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ABSTRACT 

We introduce Peano words, which are words corresponding to finite approximations 
of the Peano space filling curve. We then find the number of occurrences of certain 
patterns in these words. 
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1. Introduction and Background 

We write permutations as words tt = a\a2 ■ ■ ■ a n , whose letters are distinct and usually 
consist of the integers 1,2, ... ,n. 

An occurrence of a pattern p in a permutation 7r is "classically" defined as a sub- 
sequence in 7T (of the same length as the length of p) whose letters are in the same 
relative order as those in p. Formally speaking, for r < n, we say that a permutation 
a in the symmetric group S n has an occurrence of the pattern p £ S r if there exist 
1 < i\ < 12 < ■ ■ ■ < i r < n such that p — cr(ii)a(i2) ■ ■ ■ cr(i r ) in reduced form. The 
reduced form of a permutation a on a set {ji,j2, • ■ ■ , jr}, where ji < j% < ■ ■ ■ < j r , is 
a permutation a± obtained by renaming the letters of the permutation a so that ji is 
renamed i for all i G {1, . . . , r}. For example, the reduced form of the permutation 
3651 is 2431. The first case of classical patterns studied was that of permutations 
avoiding a pattern of length 3 in S3. Knuth [Knuth] found that, for any t £ S3, the 
number |5 rl (r)| of n-permutations avoiding r is C n , the nth Catalan number. Later, 
Simion and Schmidt [SimSch] determined the number \S n {P)\ of permutations in S n 
simultaneously avoiding any given set of patterns P ^ S3. 

In [BabStein] Babson and Steingrfmsson introduced generalised permutation pat- 
terns that allow the requirement that two adjacent letters in a pattern must be adja- 
cent in the permutation. In order to avoid confusion we write a "classical" pattern, 
say 231, as 2-3-1, and if we write, say 2-31, then we mean that if this pattern occurs 
in the permutation, then the letters in the permutation that correspond to 3 and 1 
are adjacent. For example, the permutation tt = 516423 has only one occurrence of 
the pattern 2-31, namely the subword 564, whereas the pattern 2-3-1 occurs, in ad- 
dition, in the subwords 562 and 563. A motivation for introducing these patterns in 
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[BabStcin] was the study of Mahonian statistics. A number of interesting results on 
generalised patterns were obtained in [Claes] . Relations to several well studied combi- 
natorial structures, such as set partitions, Dyck paths, Motzkin paths and involutions, 
were shown there. 

Burstein [Burstein] considered words instead of permutations. In particular, he 
found the number |[fc]™(P)| of words of length n in a k- letter alphabet that avoid all 
patterns from a set P C S3 simultaneously. Burstein and Mansour [BurMansl] (resp. 
[BurMans2, BurMans3]) considered forbidden patterns (resp. generalized patterns) 
with repeated letters. 

The most attention, in the papers on classical or generalized patterns, is paid to 
finding exact formulas and/or generating functions for the number of words or per- 
mutations avoiding, or having k occurrences of, certain patterns. In [KitMans] the 
present authors suggested another problem, namely counting the number of occur- 
rences of certain patterns in certain words. These words were chosen to be the set of 
all finite approximations of a sequence generated by a morphism with certain restric- 
tions. A motivation for this choice was the interest in studying classes of sequences 
and words that are defined by iterative schemes [Lothaire, Salomaa] . 

In the present paper we also study the number of occurrences of certain patterns 
in certain words. But here we choose these words to be the subdivision stages from 
which the Peano curve is obtained. We call these words the Peano words. The 
Peano curve was studied by the Italian mathematician Giuseppe Peano in 1890 as an 
example of a continuous space filling curve. We consider the Peano words and find 
the number of occurrences of the patterns 12, 21, 1 £ , [x-y e ), [x l -y] and [x-y l -z], where 
x, y, z G {1, 2, 3}, y = y-y- ■ ■ ■ -y (t times), and "[" in p = [x — w) indicates that in an 
occurrence of p, the letter corresponding to the x must be the first letter of the word. 

2. The Peano curve and the Peano words 

We follow [GelbOlm] and present a description (of a curve that fills the unit square 
S = [0, 1] x [0, 1]) given in 1891 by the German mathematician D. Hilbert. 

As indicated in Figure 1, the idea is to subdivide S and the unit interval I ~ [0, 1] 
into 4™ closed subsquares and subintervals, respectively, and to set up a correspon- 
dence between subsquares and subintervals so that inclusion relationships are pre- 
served (at each stage of subdivision, if a square corresponds to an interval, then its 
subsquares correspond to subintervals of that interval). 

We now define the continuous mapping / of / onto S: If x E I, then at each stage 
of subdivision x belongs to at least one closed subinterval. Select either one (if there 
are two) and associate the corresponding square. In this way a decreasing sequence of 
closed squares is obtained corresponding to a decreasing sequence of closed intervals. 
This sequence of closed squares has the property that there is exactly one point 
belonging to all of them. This point is by definition f(x). It can be shown that the 
point f(x) is well-defined, that is, independent of any choice of intervals containing 
x; the range of / is S; and / is continuous. 
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We now consider a subdivision stage (an iteration), go through the curve inside S 
starting in the point 1 (see Figure 1), and coding any movement "up" by 1, "right" 
by 2, "down" by 3, "left" by 4. Thus, we start with the first iteration X x = 123, the 
second iteration is X 2 = 214112321233432. More generally, it is easy to see that the 
n-th iteration is given by 

X n = <fii(X n _i)lX n _i2X n _i3<fi2(X n -i), 

where the function ipi (A) reverses the letters in the word A and makes the substitution 
corresponding to the permutation 4123, that is, 1 becomes 4 etc. The function ip2 
does the same, except with 4123 replaced by 2341. In this paper, we are interested in 
the words X n , for n = 1, 2, . . ., which appear as the subdivision stages of the Peano 
curve. We call these words the Peano words. 

3. The main results 

It is easy to see that the length of the curve after the n-th iteration is \X n \ = 4™ — 1. 
Moreover, the following lemma holds. 

Lemma 1 The number of occurrences of the letters 1, 2, 3 and 4 in X n is given by 
4"-\ 4™- 1 + 2"- 1 - 1, 4"- 1 and 4"- 1 - 2"" 1 respectively. 

Proof. Suppose d™ (resp. d% , d% , d"2) denote the number of occurrences of the letter 
1 (resp. 2,3,4) in the word X n . It is easy to see, using the way we construct X n , that 
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Using the diagonalization of the matrix in the identity above, namely the fact that 





(2 1 l\ 






(-1-1 


01^ 




/o 0\ 


/-1/4 1/4 


-1/4 1/4 \ 






12 10 






1 - 


-1 1 




2 


-1/2 


1/2 






12 1 






-1 1 


1 




2 


-1/2 


1/2 






\1 1 2) 






V 1 


1 1> 




\0 4/ 


V 1/4 1/4 


1/4 1/4/ 




we 


get that the vector (d",c^,d 


3,d"2) is equal to 












-1 1\ 




(I 












/-1/4 1/4 


-1/4 1/4 \ 






1 


-1 1 




2" - 1 










-1/2 


1/2 


1 




-1 


1 1 







2" 


- 1 







-1/2 


1/2 


1 




I 1 


11/ 




U 





(4"-l)/3/ 


V 1/4 1/4 


1/4 1/4/ 


w 


which is 


equal to the vector (4" 


"\4" 


-1 


+ 2"- 1 - 


lj4 n-l j4 n-l _ 


- 2™- 1 )'. 





4 



As a corollary to Lemma 1 we have the following. 
Corollary 2 The number of occurrences of the pattern t — 1-1 1 = l l in X n is 

£ 

equal to 

^4»»-i _ 2™~ 1 \ /4" _1 \ /4" _1 + 2™ _1 — 1 N 

« MJ + 

Proof. The number of occurrences of a subsequence i-i- •••-i m X n , for i = 1,2, 3, 4, is 

obviously given by ) , where is defined and determined in the proof of Lemma 1 . 
The rest is easy to see. □ 

Definition 1 Let r(A) (resp. d(Aj) denote the number of occurrences of the pattern 
12 (resp. 21), that is the number of rises (resp. descents), in a word A. 

Lemma 3 Suppose A = 1X3 and B = 2Y2 for some words X and Y . Then 
r(^(A)) = d(A) + 1, d( V i(A)) = r(A) - 1, r(ip 2 (B)) = d(B) and d(<p 2 (B)) = r(B). 

Proof. If A and B denote the reverses of A and B then r(A) = d(A), d(A) = r(A), 
r{B) = d(B), and d(B) = r(B). 

We consider two factorizations of each word A and B. We can write A as 

A = 3Ai 1 . . . 1 A 2 1 . . . 1 A 3 . . . A k 1 . . . 1, 



where Ai, for i = 1, 2, . . . , k is a word over the alphabet {2, 3, 4}, only Ai can be the 
empty word e, and ij > 1 for j = 1, 2 . . . , k. Also, we can write A as 

A - 3A Q 4_.A A[ 4. .A A 2 . . . A k _ l A^.AA k l, 



where A^ for i = 0, 1, . . . , k is a word over the alphabet {1, 2, 3}, only A and A k can 
be e, and i • > 1 for j = 1, 2 . . . , k. 
The word B can be written as 

B = 2B 1 . . . 1 B 1 1 . . . 1 B 2 . . . 1 ... 1 B t 2, 



where Bi, for i = 0, 1, ...,£, is a word over the alphabet {2, 3, 4}, only Bo and Be can 
be e, and ji > 1 for i = 1, 2 . . . , I. Also, B can be written as 

B = 2B a A_^AB l A_^AB 2 . . . B^A^ABfi, 

J .! J 

J\ J2 Jt 

where for i = 0, 1, . . . , is a word over the alphabet {1, 2, 3}, only B' Q and B t can 
be e, and > 1 for i = 1, 2 . . . , L 
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It follows from the definitions that ifi(A) and <fii(B) (resp. f 2 (A) and ip 2 (B)) are 
obtained from A and B by permuting the letters with the function -k\ (resp. 7r 2 ) that 
acts as the permutation 4123 (resp. 2341). 

We now consider the first factorizations of A and B, and the function m. It is easy 
to see that if W is equal to A i: or B iy or 3Ai, or 2B , or Bg2, then r(W) = r(TTi(W)) 
and d(W) = d(ni(W)), since m is an order-preserving function when it acts from the 
set {2,3,4} to the set {1,2,3}. From the other hand, occurrences of the rises 12, 13 
and 14 (resp. the descents 41, 31 and 21) in A and B, give occurrences of the descents 
41, 42 and 43 (resp. the rises 34, 24 and 14) in -K\(A) and tt\(B) respectively. If we now 
read the first factorizations of A and B from the left to the right, then the occurrences 
of the subwords al alternate with the occurrences of the subwords lb, where a, 6 G 
{2, 3, 4}. Moreover, in the factorization of A, we begin and end with the subword al 
for some a £ {2,3,4}, which gives that d{A) + 1 = r(A) + 1 = r(n 1 (A)) = r(ip 1 (A)) 
and r(A) — 1 = d(A) — 1 = d(-Ki(A)) = d((fii(A)); in the factorization of B, we begin 
with the subword al and end with the subword 16 for some a, b £ {2, 3, 4}, which gives 
that d(B) = r(B) = r(Tn(B)) = r(yi(B)) and r(B)_= d(B) = d(in(B)) = d(<f>i(B)). 

If we consider the second factorizations of A and B, and the function 7r 2 , one can see 
that if W is equal to A\, or or 3Aq, or A' k l, or 2B' Q , or B' e 2, then r(W) = r(ir 2 (W)) 
and d(W) — d(7r 2 (VF)), since 7r 2 is an order-preserving function when it acts from 
the set {1,2,3} to the set {2,3,4}. From the other hand, occurrences of the rises 
14, 24 and 34 (resp. the descents 41, 42 and 43) in A and B, give occurrences 
of the descents 21, 31 and 41 (resp. the rise 12, 13 and 14) in ir 2 (A) and n 2 (B) 
respectively. If we now read the second factorizations of A and B from the left to 
the right, then the occurrences of the subwords a4 alternate with the occurrences 
of the subwords 46, where a, 6 £ {1,2,3}. Moreover, in both cases, we begin with 
the subword a4 and end with the subword 46 for some a, 6 £ {1,2,3}, which gives 
that d(A) = r(A) = r{m{A)) = r{<p 2 (A)), r(A) = d(A) = d{>K 2 {A)) = d(<p 2 (A)), 
d{B) = r(B) = r(ir 2 (B)) = r{<p 2 (B)) and r(B) = d(B) = d(i: 2 {B)) = d(ip 2 (B)). □ 

Theorem 4 Let r n (resp. d n ) be the number of occurrences of the pattern 12 (resp. 
21) in X n . Then for all k>0, 

r 2fe +i = f(4-16 fe + l), 
r 2k+2 = f (16 fe+1 - 1), 
d 2k+ i - f(16 fe -l), 

d 2fc+ 2 = |(16 fe+1 -l). 

Proof. Using the properties of ipi and ip 2 , as well as the way we construct X n , it is 
easy to check by induction, that X 2 fc+i and X 2k + 2 can be factorized as follow: 

X 2k+1 = lX^hl 2Y^2 222^233Z^h, 
X 2k+2 = 2X^h 1 11^211^33 4Z^2 , 

¥>l(^4fc+l) X 4k+1 X ik+1 ¥2(X 4k + 1 ) 
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where X«, F« and Z« are some words for i = 1,2. 

Suppose we know r2k+i and d,2k+i for some k. Since X2k+i = 1W3 for some word 
W, using Lemma 3 and the factorization of the word X2k+2, we can find r2k+2 and 
d 2 k+2- Indeed, fi{X ik+1 ) has d 2 k+i + 1 rises and r 2 k+i - 1 descents; ^ 2 (^4fc+i) 
has c?2fe+i rises and r2k+\ descents; two subwords X2 k +i give 2r2 k +\ rises and 2d,2 k +i 
descents. Besides, we have some extra rises and descents appeared between different 
blocks of the decomposition. They are one extra rise between the letter 3 and the 
subword ^(X^k+i), and 3 extra descents between the subword ipi(Xik+i) and the 
letter 1, the subword X 4k+ i and the letter 2, the letter 2 and the subword X^k+i- 
Thus, r 2 k+2 = 2r 2 fc+i + 2d 2k +i + 2 and d 2k +2 = 2r 2k+ i + 2d 2k +i + 2, which shows, 
in particular, that for even n, in X n , the number of rises is equal to the number of 
descents. 

We now analyze the factorization of X 2k +3, which is similar to that of X 2k +i- 
Using the fact that X2 k +2 = 2W 2 for some word W and Lemma 3, we can find 
r 2k +3 and d 2k +3- Indeed, we can use the similar considerations as above to get 
r2k+3 = 2r 2 fc +2 + 2d 2k +2 + 2 = 8r 2fe+ i + 8d 2k +i + 10 and d 2k +3 = 2r 2 k+2 + 2d 2k +2 = 
8r 2 /t+i + 8d 2 fc+i + 8. Thus, if Xk denote the vector (r 2 fc+i, cfcfc+i) then 



Xk+l = 




with xo = (2,0), since in X\ = 123, there are two rises and no descents. This 
recurrence relation, using diagonalization of the matrix in it, leads us to 

^ = (^(4-16 fe + l),^(16 fe -l))'. 

Finally, r 2fe+ 2 = d 2k+ 2 = 2r 2fe+ i + 2d 2k +i +2= f (16 fe+1 - 1). □ 

Let N T (W) denote the number of occurrences of the pattern t in the word W. 

Using Lemma 1 and the proof of Theorem 4, we can count, for X n , the number of 
occurrences of the patterns t\(x, y) = [x-y l ), T2(x,y) = x e -y] and t$(x, y, z) = [x-y l -z], 
where x,y,z £ {1,2,3}, y l — y-y- ■ ■■-y [l times), and "[" in p = [x — w) indicates 
that in an occurrence of p, the letter corresponding to the x must be the first letter of 
the word, whereas "]" in T${x,y, z) indicates that in an occurrence of t${x, y, z), the 
letter corresponding to the z must be the last (rightmost) letter of the word. 

If we consider, for instance, the pattern n(l,2) = [1-2^) then the letter 1 in this 
pattern must correspond to the leftmost letter of the word X n . Now if n = 2k + 1 
then from the proof of Theorem 4 X n = 1W for some word W, which means that 
to the sequence 2 e there can correspond any subsequence i e in X n , where i = 2,3,4. 
Thus, using Lemma 1 and the way we prove Corollary 2, there are ( 4 ^ 2 ) + ) + 

t ) occurrences of the pattern t\(1, 2) in X2 k +\- If n — 2k + 2 then X n — 2W 
for some word W and for the sequence 2 there correspond any subsequence i e in X n , 
where i = 3,4. Thus, 7V ri(1>2) (X 2fe+2 ) = f k ~? k ) + ( 4 ^). 

In the example above, as well as in the following considerations, we assume I to 
be greater then 0. If t = then obviously N Tl ^ x y ^{X n ) — N T2 ^ x y ^{X n ) = 1, whereas 



7 



N T3 ( x ,y,z)(X n ) is equal to 1 if x < z and n = 2k + 1, or x = z and n = 2k + 2, and it 
is equal to otherwise. 

When we consider t^(x, y, z){X n ), we observe that since X 2 k+ 2 = 2W2 for some 
W, N T3 ( x y z ){X2k+2) = 0, whenever x ^ z. Also, since X 2 k+i = 1W3 for some W, 
N Ts(x,y,z)( x 2k+i) = 0, whenever x > z. 

Let us consider the pattern ts(2, 1,3) = [2-1^-3]. As it was mentioned before, 
-^r 3 (2,i,3)(^2fe+2) = 0. But, if we consider X 2 k+i = 1W3, then it is easy to see 
that 7V T3 ( 2 ,i.3)(A2fe+i) = 0, since the leftmost letter of X 2 k+i is the least letter, 
which means that it cannot correspond to the letter 2 in the pattern. As one more 
example, we can consider the pattern r 3 (l,l,2) = [1-1^-2] . We are only interested 
in case X n = X 2 k+\, since N T3 n 12 ) {X 2 k+ 2 ) = 0. The number of occurrences of 
the pattern is obviously given by the number of ways to choose t letters among 
4 2fe — 1 letters 1 (totally, there are 4 2fc letters 1 according to Lemma 1, but we cannot 
consider the leftmost 1 since it corresponds to the leftmost 1 in the pattern). Thus, 

N T3 (i.i,2)(X 2 k+i) = ( 4 f 1 )- 

All the other cases of x, y, z in the patterns T\{x,y), T 2 {x,y) and T^{x,y,z) can be 
considered in the same way. Let S\ and S 2 denote the following: 

/^2fc _ 2 2k \ /4 2fe \ /A 2k + 2 2k — l\ /4 2k + 1 \ /42/c+i _ 22fc+i 

Sl ={ I ) + + { * )' S2= { I ) + ( I 

The tables below give all the results concerning the patterns under consideration, 
except those triples (x,y, z), for which N T3 ( Xty ^(X n ) = for all n. 
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Figure 1: the Peano words 



